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Abstract —In this paper, we propose a novel formulation 
to understand the tradeoff between channel utility, meta 
information leakage and hypotheses distinguishability. As an 
example, such problems could emerge when online users aim 
to hide some level of information about themselves while 
revealing enough to be distinguishable from others. 

Under the framework, we formulate two constrained 
optimization problems. In these problems, the goal is to 
maximize binning utility while restraining a certain level 
of information leakage and to maximize the measure of 
distinguishability between two hypotheses while restraining a 
certain level of information leakage in the former and latter 
cases respectively. In both cases, we accomplish this goal by 
properly dividing a set of M random items into N bins. Such 
problems (where optimization is carried out over a diverse 
but yet dependent series of sets) are formally known as multi¬ 
agent multi-variate optimization problems. 

Both of the constrained optimization problems are NP hard 
by nature. We then propose suboptimal solutions to exploit 
potential sub-modular structures of the problems based upon 
sufficient conditions found on the payoff functions in the 
underlying objective functions. Further numerical results are 
presented to demonstrate the effectiveness of the proposed 
sub-modular function based algorithms. 

Index Terms —partition, information leakage, privacy, mu¬ 
tual information, hypotheses distinguishability, submodularity 


I. Introduction 

In our daily use of internet there are two very specific 
issues of privacy. The first issue rises when we aim to 
hide as much vital information about ourselves while still 
enjoying the service of web -which is always prone to 
eavesdrops. As an example, we hope to use the Google 
search engine without a third party studying our search 
history and learning about us. For future references, we 
refer to this class of problems as Utility versus Privacy 
(UvP). 

The second scenario rises when we wish to be distin¬ 
guished from other users but not reveal all the information 
we share on the web. As an example, when we surf Ama¬ 
zon, based upon our previous searches over the website, we 
are offered new suggestions we may enjoy. However, we 
hope to keep our privacy intact to some level at the same 
time. If we search for a specific political ideology, we do 
not like to be suggested a book on such an idea. If we were, 
we would feel uneasy about how much private information 
about us has leaked. This class of problems will be referred 
to as Utility versus Privacy and Distinguishability (UvPD). 


It could then be deduced that in both scenarios we aim 
to maximize a utility reflected by our browses (a good 
internet service in the first case and suggestion of a relating 
product in the second) while keeping the privacy reflected 
by patterns shown in sequence of webpages visited and 
obscured by the underlying browses to a certain upper 
bound. 

It could be argued that the best solution for such invasion 
of privacy problems for a user is to find approaches seeking 
tradeoff between some utility and the resulting privacy 
leakage. 

One method to carry this out, is the use of website 
proxies. Such websites offer the user complete anonymity 
by coding the IP address of the visited web pages into a 
series of characters which can no longer be traced. 

As an example if a user visits the proxy website 
www.proxy.com, they will see a URL box where they could 
enter the name of their intended web page. The proxy 
website will then open the new web page as an extension 
of its own URL, meaning in the address bar, there will 
be no trace of the desired web page. Furthermore, even if 
the eavesdropper attains the new link in the address bar, 
they cannot simply revisit the website as the new address 
is created only for the momentary accommodation of the 
user and will be destroyed as soon as the user is done with 
the URL. 

The only detriment to such proxy websites is their 
limited bandwidth and time-outs. As a solution, we can 
employ multiple proxy websites and thus cut down on the 
utility loss. However, by using multiple proxy sites we are 
allowing a level of privacy breach into our browses where 
the eavesdropper is able to deduce some information about 
our tendencies by observing the distribution of used web¬ 
pages over multiple networks. Then, if a utility function 
based upon connection speed -bandwidth- for the user is 
calculable, a privacy constrained problem between the user 
and an eavesdropper (for example a service provider) could 
be defined. 

To formalize such trade-off problems, we propose a 
partitioning framework in which a set of M items (e.g. 
websites) are placed into N bins (e.g. proxy sites) to 
maximize certain payoff function while meeting constraints 
imposed due to concerns of privacy leakage or distinguish¬ 
ing capability. Then, a given distribution G\, G2, ... on the 
random items represents a particular operating mode or 
behavior over a larger scale, which the user is willing to 
be classified. Under a particular distribution, however, a 



user is more concerned of the sequence patterns, which 
could be inferred to certain extent by the resulting patterns 
of bins (e.g. proxy sites) deployed. On top of these two 
issues, a utility function for each bin is employed to reflect 
the throughput for this particular bin. 

Our overarching goal is to pose and then solve the un¬ 
derlying constrained combinatorial optimization problems 
by first considering the tradeoff between meta leakage and 
binning utility, and then to further add the distinguishability 
into the objective by seeking optimal binning of items. 

Due to the exponentially growing cost in searching 
such partitions, we instead seek sufficient conditions under 
which polynomial order complexity algorithms exist by 
resorting to multi-submodular and submodular structures 
in our problems. 

A. Related Works 

The concept of privacy has already been explored in 
many works such as [1], [2] where a general but non- 
mathematical explanation was offered. However; in our 
work, we go into further details as to what privacy rep¬ 
resents in our framework and how it could be formulated 
into many settings. 

As for the addition of distinguishability, to the best of our 
knowledge, the closest work to ours was done by [3] where 
they considered the tradeoff between distinguishability 
and information leakage when the former is quantified 
using KL-distance and the latter using mutual information. 
Lately, [4] also considered a problem close to ours where 
they used the concept of missed detection and false alarm 
as the basis of hypotheses testing. However, in both works, 
the authors were interested in a randomized binning process 
which tended to be more complex than our routine. In both 
cases, this came at the cost of sacrificing the concept of 
trade-off by only allowing very small privacy leakage (less 
than e) in order to invoke a first order approximation to 
reduce the computational complexity of the optimization 
problems. However; in our work, we are concerned with 
any amount of trade-off with deterministic binning to seek 
suboptimal and polynomial order approximations enabled 
by multi-submodular set function conditions. 

The multi-submodular function was discussed exten¬ 
sively in [5] where they introduced a series of sufficient 
conditions on multi-submodular set functions by which 
the multi-submodular problem could be transformed into a 
submodular set function problem. Then, further discussions 
about the existence of a solution to the new problem 
were made. By doing so -and if a solution were proven 
to exist-, the complexity of the problem could be shown 
to be reduced from NP to polynomial. Such sufficient 
conditions have been adopted by us in seeking multi- 
submodular solutions in our problem settings. 

This paper represents an expansion of our previous 
works in [6], [7] where in [6] we had a goal of finding 
the average time required to detect one of multiple non¬ 
overlapping specific subgraphs. This goal was carried out 
by finding the output of the network to a set of pre-specified 
queries. 

In this paper, we shift our attention from detection of 
an active subgraph to seeking tradeoff between optimizing 


utility of partitioning a set of random items and restraining 
information leakage. 

B. Paper Contributions 

It is important to note that some of the problems in¬ 
troduced in this paper, were partially considered in our 
previous work [8]. However, due to page limitations, the 
mathematical derivation of the results was left out. 

As for the novel problem addressed in this paper, our 
goal is to offer (1) a measure of distinguishability between 
the K = 2 hypothesis; (2) a measure of average leaked 
information given any of the K hypothesis is active; (3) 
a formulation of a multi-agent multi-variant optimization 
problem with a privacy leakage constraint; (4) the devel¬ 
opment of insight into the complexity of such an A'/Miard 
problem and further sufficient conditions under which we 
can simplify it into a polynomial problem; (5) a description 
of the algorithm utilized to find the solution given the 
sufficient conditions followed by the proximity results and 
finally (6) numerical examples to further demonstrate the 
applicability of submodular solutions, as compared with 
the results using exhaustive search in a small scale setting. 

C. Paper Organization 

The rest of this paper is organized as follows. In Section 
II we formulate the problems in terms of privacy and utility 
functions. We dissect what the goal and the constraints are. 
In Section III, we inspect the overall utility functions of 
each problem and then find the sufficient conditions under 
which they are equipped with multi-submodular property. 
In Section IV an algorithm for each case is developed 
offering a polynomial complexity and an accuracy level 
of e or { depending on whether the goal is to minimize or 
maximize the overall utility function respectively. Finally, 
in Section VII, we conclude the chapter. 

II. System Model 

First, we propose an abstract framework to formalize the 
goal of seeking partition of M items into N bins. More 
specifically, we aim to allocate each one of 1 < i < M 
possible items (queries) to one of N output bins. There 
could be at most N M such partitions. It follows that any 
set allocation Ai, 1 < l < N M results in N sets C 
{1, 2, M}, j = 1, 2,iV. Each such set is defined as 

-sf = Wij = i} 

whe " "« = {o !*$> (1) 

We further assume fl =$,j^k. Furthermore we 
have (J^_i = {1,2Finally the size of each 

such set S^ 1 is defined as L^ l \ 

A. Probabilistic Model 

We assume at any time slot one and only one of 
the inputs is chosen with a certain probability. Thus, 
assuming hypotheses G p is active and if we use variable 
X £ {1,2as a representation of set of items, we 
could have P(X = i\G p ) = P{ 7 » p = 1) = n ijP ,l < 
i < M,,p £ {1,2} as a representation of the probability 
of choosing item i from the set X under hypotheses G p 



where 7 ip £ {0,1}. It further follows that 1 7 % p = 

l,p G {1, 2}, stipulating that one and only one of M items 
is selected. 

Next, we introduce an observable random variable Y £ 
{1,2,-•• , TV}, denoting the index of the bin (the proxy 
site) employed to carry one of the M > N items. It follows 
that the probability of each bin’s appearance given a set 
allocation scheme such as Ai under hypotheses G p will be 
equal to 

P(Y=j\A l ,G p ) 

M 

= Y^P(Y = j\A u X = i, G p )P(X = i\A t ,G p ) 

i—1 

M 

= Y J P{Y =j\A u X=i)P{X=i\G p ) (2) 

2 = 1 

Furthermore, P(Y = j\Ai,X = i) = of) G {0,, 1}. It 
thus follows that 

M 

P(Y = j\Ai,G p ) = 0$P(X = i\G p ) 

2=1 

P(Y = j\A u G p ) = £ *i r =<*? p) ( 3 ) 

iesf 

B. Revealed Information 

By choosing to allocate M items to N bins where 
N < M, we have injected ambiguity and uncertainty into 
the output binning index sequence about the input item 
sequence over a successive n visits or channel uses. In 
other words, if we originally chose to transmit n of such 
item s, our total set of possible sequences would be of form 
X n = [A'i X‘ 2 ...X rl } out of M n possible outcomes. From an 
observer’s perspective which can only have access to which 
one of N bins is deployed in each time slot, sequences in 
the form of Y 1 ' = [Vj Y 2 ...V' n ] has cardinality of at most 
N n < M". Despite the amount of uncertainty added due 
to the many-to-one mapping between items and bins, the 
output sequence sill reveals certain amount of information 
regarding the patterns of sequences of M random items. 

This observation could be further studied by indicating 
how our allocation system resembles a coding framework 
where we have an equivalent channel whose input variable 
is X and output V, as show in Figure 1. 



Figure 1. Coding Channel Representation of the Problem 

Under such a framework, the equivalent channel output 
sequence Y can help an eavesdropper classify the input 
sequence X into a number of differential classes. As a 


result, information about the specific input item patterns 
is leaked to certain degree and can be measured using 
conditional mutual information I(X;Y\Ai,G p ) between 
X and Y, under a hypotheses G p given a particular binning 
(i.e. partition Ai relationship as illustrated in Figure 1. 

Such conditional mutual information thus measures the 
maximum number of bits of meta information about item 
sequence per channel use. Therefore, we can have at most 
2 m(x-Y\A l ,G p ) se q Uences X ra distinguishable by inferring 
based on Y”. We thus adopt I(X-Y\Ai, G p ) as the privacy 
metric conditioned on a particular partition mapping Ai 
under a specific hypotheses G p . It follows that due to the 
combinatorial nature of a set allocation problem there are 
a total of N m possible methods to allocate these M items 
to the N sets. 

We can formulate the mutual information over a set 
allocation Ai,l < l < N M under hypotheses G p as: 

I(X;Y\A i ,G p ) = H(X\A l ,G p )-H{X\Y,A 1 ,G p ) = 
H(Y\A h G p ) - H(Y\X,A h G p ) = H(Y\A h G p ) = 

H(a«’ p \a«' p \...,a% p) ) (4) 

where we have used the notion of TT(ai, a 2 , a m ) = 
~ Y^™=i a v\°& a v an d the fact that H(Y\X,Ai,G p ) = 0 
because if the input X and the channel scheme Ai and 
hypotheses G p are known, then output Y will offer no 
uncertainty. 

C. Distinguishability Measure 

In this section, we aim to define a measure which 
can evaluate the distinguishability between a number of 
multinomial distributions. 

Given that a certain set allocation Ai has been chosen, 
depending on the choice of G p , two different distributions 
defined by using variables could be developed where 
we have: 

af p) = P(Y = j\A h G p ),p £ {1,2}, j £ (5) 

where the two probability sets Pi = = 

1,2and P 2 = = 1,2,...,TV} represent 

the multinomial distributions under allocation scheme l 
given two prior probability distributions G 1 and G 2 which 
we aim to distinguish. To evaluate the distinguishability 
between two such distributions, we utilize the definition 
of symmetric KL divergence. We would have the measure 
defined as: 

1 -{KL{Pi\\Pi)+KL{P 2 \\Pi)} 

= ^ ( q j U) ~ ’ 2) )( lo S - l°g ( 6 ) 

Z j=i 

We could now use the above definition in developing our 
problem formulation. 

It is important to note how this measure of distinguisha¬ 
bility is different in nature than that of leaked information. 
Through binning, a sequence of outputs are produced 
which we aim to limit. However, we could use an average 
(note the average nature of KL distance) of these sequences 
over multiple choices as a measure of maximum likelihood 
detection deciding which hypotheses is truly active. By 




doing so, we are both controlling the level of leaked 
information and utilizing it for further gain. 

D. Problem Formulation 


Using the definitions addressed above, we could show 
that the UvP problem could be modeled in the form of 
finding the allocation scheme Ai which satisfies [8] 



where fj(Sj 1,1 ^) represents the utility function offered 
by the j th bin while using allocation scheme A / under 
G p hypotheses and A > 0 demonstrates the Lagrange 
multiplier accounting for the leaked information constraint. 
Equation (7) is simply a sum of functions defined over 
a series of sets. We refer to these as multi-variate set 
functions seeing as how their values are based upon spe¬ 
cific sets and variables introduced in each of these sets. 
Furthermore, we note that since there is only one active 
hypotheses Gi in this scenario, we can have = a.I" 1 

and f j (S?’ 1) ) = f j (S? ) ). 

We then aim to distinguish these two distributions. We 
thus choose binning as our method of observation meaning 
we attempt to find an M-to-N mapping between the set 
of M items and set of N outputs which can best help us 
distinguish which multinomial distribution was chosen. 

Overall, we aim to find a partition of M items under 
which we can maximize the binning utility of the solution 
while keeping the distinguishability between G i and Gi 
and the leaked information about the item probabilities 
higher and less than some thresholds respectively. 

We next pose our problem as a constrained optimization 
problem: 



holds true. Furthermore, Ai,A 2 > 0 are Lagrange multi¬ 
pliers accounting for the constraints over distinguishability 
and leaked information respectively. 

III. Multi-Submodular Set Functions as a 
Means of Solution 

As mentioned previously, both UvP and UvPD are NP- 
complicated (they are solved when searches over N M and 
2 m possible set allocations are carried out respectively and 
the best allocation is chosen). Still, we could opt to utilize 
the definition of multi-submodular set functions so as to 
reduce the complexity to that of polynomial at the cost of 
accuracy. 


A. Imposing Multi-submodularity 

In [5], it was shown that if we can prove multi¬ 
submodularity for functions such as those formulated in 
Eq. (7), then they could be modeled as simpler problems 
(submodular set functions). We thus, aim to find the 
sufficient conditions for such occurrence. To do so, we first 
offer a review of multi-submodularity. 

As in [5], if we define M = {1, 2,..., M}. then a multi¬ 
variate function F : (2 M ) N —> K+ is multi-submodular if 
for all pairs of tuples (Si,..., Sn) and (T), ...Tjv) € (2 M ) N 
we will have: 

F(S U ..... S N ) + F(Ti,..., T n ) > F(S 1 U Ti,..., S N U T N ) 
+F(SxDT 1 ,...,S N nT N ) (9) 

Since in our formulation functions are separately defined 
on different sets, the condition in Eq. (9) is simplified 
to the sufficient condition of submodularity of F;. l> = 
OLp fj ( Sj^ ) + Xa^ log ) for all sets Sj. We thus need 

to find the sufficient condition for submodularity of F ( - 11 
when defined over a set Sj. 

As for the UvPD problem, we already are aware that all 
we needed to impose was submodularity over the overall 
utility function. 

B. Imposing Separate Submodularities for UvP 

For an easier mathematical representation of the follow¬ 
ing derivation we denote F(Sj) = F^ l \ Furthermore, we 
denote fj(S y^) = Both these denotations allude 

to the fact that once a set allocation Ai is chosen, its index 
could be dropped. 

In the next step, we opt to use diminishing return 
property as the means of making certain each of these 
functions are submodular. Following is a definition of 
diminishing returns for submodular functions, after which 
we derive the sufficient conditions for the case discussed 
in UvP). 

Diminishing Property Return dictates that if we define 
S as the universal set, a set function F : 2 s —> R + is 
submodular if, for all A,B C S with A C B and for each 
x € S — B we have [9]: 

F(Au{x})-F{A)>F(BU{x})-F(B) (10) 

Now we attempt to expand Eq. (10) for each F(Sj). 
However, to properly do so, we first need to account for 
the behavior of this function. 

We have defined F(Sj) = ctjf(Sj)+\aj log (ay) where 
it seems that the function has a singular relationship with 
set Sj. However; there is a secondary relationship the func¬ 
tion shares with the set S*f = S — Sj where S = S repre¬ 
sents the universal set. This relationship could be modeled 
as F(Sf) = (1 - fr)f(S - Sf) + A(1 - fij) log (1 - ft) 
where we have used the fact that f3j = 1 — ay seeing as 
how we define 

Thus, for any set Sj, we must find the sufficient 
conditions for the existence of diminishing property for 





both functions F 1 (Sj) = ajf(Sj) + \a :j log (a ? ) and 
F 2 (Sj) = (1 - aj)f(S - Sj ) + A(1 - log (1 - aj). 

To do so, we will evaluate their sufficient conditions and 
then find their intersection as the final conditions (assuming 
they do not negate one another). 

Note: For any further references, we first need to address 
a series of variable and function definitions which are going 
to play a vital role in the rest of this chapter: 

Definitions for Problem 1 

1. Any variable represented with a capital Letter repre¬ 
sents a set. 

2. Any variable represented with a small letter repre¬ 
sents an element. 

3. A — B represents a set containing all elements of set 
A which do not appear in set B. 

4. a x represents the probability of item x and cxa 
represents the sum of probabilities of items mapped 
into a set A. 

5. cx ba represents the difference in the sum of proba¬ 
bilities of items mapped into the sets B and A which 
could be further shown as cxba = cub — cxa- 

6. g(C, D) represents the 1 st order difference of a set 
function /(C) from f(C — D) where D C C which 
could be formulated as g(C, D) = f(C) — f{C—D). 

7. q(C, Ci, D, Di) represents the 2 nd order difference 
of a set function /(C) where Ci C C and D i C 
D which could be formulated as q(C , Ci, D, D i) = 
g(C,D)-g(C 1 ,D 1 ). 

8. We assume the probability of items is sorted in a 
decreasing manner such as 7ri > it 2 > ... > tcm- 

Theorem III.l. The set functions iq (Sj) mid F 2 (Sj) and 
as a result F(Sj ) are submodular if 

(1) g{Sj,s w ) < 0 

(2) q(Sj,S u ,S w ,Sy) < 0 

(3) \g(Sj,S w )\> Alog(^) 

for all possible sets S w C Sj C S and S u C Sj and 
S y C S w where S is the universal set. 

The proof for this theorem is presented in the Appendix 
under Theorem III.l. In the proof, a series of sufficient 
conditions for either F\(Sj) and F 2 (Sj) are evaluated 
separately. This is done because although their conditions 
turn out to be the same, their derivations are vastly different 
and require separate discussions. We then acquire the more 
restrictive set of sufficient conditions between the two 
sets of sufficient conditions for either T) (Sj) or F 2 (Sj). 
In the proof, we rewrite inequality (10) for set function 
F-| (Sj), start factorizing a a, cub A separately and a x and 
1 together and impose sufficient conditions so that each of 
their coefficients is always positive. 

Unfortunately, [5] does not turn a multi-submodular 
problem to a submodular one; they simply prove that this 
could be done. Thus, in order to expand upon the idea of 
polynomial complexity of solution algorithms we opt to 
assume N = 2 and offer the reader the algorithm to deal 
with such a specific case. We then calculate the complexity 
imposed by the algorithm to further stress the benefits of 
using such an idea at the expense of accuracy. 

C. Specific Case of N = 2 

As mentioned previously, in order to show the applica¬ 
bility of submodular functions we choose to reiterate the 


overall utility function dictated in UvP for when N = 2: 

1 max^ a ( ^f(S[ l) ) + (1 - a^ ] )f(S - S[ l) ) 

+A (Ith + otl' 1 log (af } ) + (1 - a ( p } ) log (1 - a^)) 

= T(Si) (12) 

As can be seen, the problem is still exponentially com¬ 
plex seeing as how we need to search over 2 M possible 
solutions to find the optimal. Thus, once again we aim 
to impose multi-submodularity (in this case simplified to 
submodularity) on the new utility function. For the utility 
function above the same results derived for a general N 
could be used as a set of sufficient conditions. However, 
taking into account the joint relationship between the 2 sets 
and writing the same Inequality (10) for Equality (12) we 
are able to find a less restrictive set of sufficient conditions 
for the submodularity of this utility function as indicated 
below: 

Theorem III.2. When N = 2, the function in Eq. (12) is 
submodular if 

(1) g(Sf\S§)< 0 

(2) 2\g(Sf\s^)\>Wog(l + ^) 

(3) q (sf\s { J\s { J!;\s^)< 0 

for all possible sets Sw C C S and Su ^ C 
and Sy ^ C Sy,' 1 where S is the universal set. 

where we have deduce that the new set of sufficient 
conditions are less restrictive because they allow a smaller 
lower bound (a looser lower bound) for the absolute value 
of the first order derivative of the set function f(Sj). 

The proof for this theorem is presented in Appendix 
under theorem III.2. In the proof, we rewrite inequality 
(10) for set function described in Eq. (12), start factorizing 
ctA,ceBA separately and a x and 1 together and impose 
sufficient conditions so that each of their coefficients is 
always positive. 

D. Imposing Separate Submodularities for the UvPD 
Problem 

In this section, we aim to find sufficient conditions for 
multisubmodularity of the overall utility function described 
for the UvPD Problem. 

Once again, we use the intuition offered in the previous 
section meaning we aim to break down Eq.(8) into a sum 
of simpler set functions and then impose submodularity 
over all such sets and thus guarantee the overall multisub¬ 
modularity as well. Following the previous reasoning, we 
aim to find the sufficient conditions of submodularity for 

F(Sf) = P{Gj)[af l) f{Sf X) ) + \ 2 af l) logoff] 
+P(G 2 )[af [2) f(Sf 2) ) + A 2 af 2) logaf 2) ] 

+ y(aj i ’ 1) - o^’ 2) )(log a < J l,1) - log(13) 




for all sets Sj. Assuming that a specific allocation scheme 
Ai has been chosen, we can further simplify Eq.(14) by 
rewriting 

) = F(Sj) = P(G 1 )[al 1) f(S^ ) ) + \ 2 af log af] 
+P{G 2 )[af f{Sf ] ) + A 2 af log af] 

+ - log cty Z ' 2) ) (14) 

where we have simply dropped every iteration of l from 
our previous formulation for easier representation. 

As was the case previously, it seems that the function 
F{Sj) has a singular relationship with set Sj. However; 
there is a secondary relationship the function shares with 
the set S*f = S — Sj where S = § represents the universal 
set. This relationship could be modeled as 

F(Sf) = P(Gr)[(l - sf’ 1)C ) 

+A 2 (l-^’ 1) )log(l-^’ 1) )] 

+p(g 2 )[(i - - sf’ 2)C ) 

+A 2 (l-/3f 2) )log(l-/3f 2) )] 

+ Y^ l ’ 2) - ^• 1) )( 1 °S (! - P?) ~ log (1 - /?f)) 

(15) 

where we have used the fact that /3j = 1 — ay- seeing as 
how we define 

/3j l) = n ' 1 ( 16 ) 

ie{s-s™} 

Thus, for any set Sj , we must find the sufficient con¬ 
ditions for the existence of diminishing property for both 
functions described in Eq.(14) and Eq.(15). To do so, we 
will evaluate their sufficient conditions and then find their 
intersection as the final conditions (assuming they do not 
negate one another). 

Theorem III.3. The set functions defined for the UvPD 
problem is submodular if 


(1) q(Sj,S w ,S r ,S t ) < 0 

(2) g{Sj,S w )< 0 

(3) |2(S,-,SJ|> 


max{ A 2 log lo[ + 
A 2 log J 2 


Ai 


P{G\ 

Ai 


log {w[u/ 2 ), 


P{G 2 ) 

1 


log (Wjw'l)} 


TTMi 


TTjVf, 


for all possible sets S w C Sj and St C S r and S r C S r 


The proof for this theorem is presented in the appendix 
under Theorem III.3. Once again in the proof, a series 
of sufficient conditions for F(Sj) as formulated in either 
Eq.(14) or in Eq.(15) are evaluated separately. This is done 
because although their conditions turn out to be the same, 
their derivations are vastly different and require separate 
discussions. It follows that since both functions require 
the same set of sufficient conditions, the overall function 


F(Sj) which represents either representation being chosen, 
also follows the same set of sufficient conditions. 

It is important to note how adding distinguishability 
as a factor to account for has changed the dynamic of 
our sets of sufficient conditions to assure accessibility 
of a submodular solution for the problem. In the UvP 
formulation of the problem, A referred to the Lagrange 
multiplier accounting for the leaked information constraint. 
The same role is played by A 2 in the formulation of 
UvPD problem. Furthermore, in both problems, we aim to 
maximize a utility while keeping the leaked information 
to a threshold. This would mean that if there were no 
concerns of distinguishability in the UvPD problem, we 
would be expecting the same sets of sufficient condition 
as the two problems basically become the same. As can 
be seen, if we assume Ai (which represents the Lagrange 
multiplier accounting for the distinguishability measure) is 
very small, then the two sets of sufficient conditions turn 
out to be the same (noting that there is basically only one 
hypotheses to account for when distinguishability is of no 
importance). 


IV. Submodular Solution 
I n this section, we offer two algorithms specifying how 
each UvP and UvPD could be solved in a polynomial 
manner (when N = 2). We further expand upon how 
the complexity solutions has been compromised. Finally, 
we offer an example to showcase the relationship between 
solution error and complexity. 

A. Solution to UvP 

It is important to note that while all these papers dealt 
with the issue of constraints, they assumed much more 
complex constraints than we are dealing with in this 
discussion. Our only constraint is that a\ l> < a s where we 
assume h(a s ) = Ith, cx s < 0.5 which is obviously a down- 
monotone constraint. Thus, we can simply use the results 
from their work to create our own algorithm to find the 
submodular function solution to our problem. We present: 

Algorithm P. Submodular Function Solution to the UvP 
Problem) 

1. Let Si = arjmax egx= rj iM iT[Si = {e}] while 
|0.5-c4 M) | > 10.5 — a s |. 

2. If there is an element e £ X \ S\ such that 
T[Si + {e}] > T[Si] and |0.5 - otp - a e \ > 

10.5 — q s |, let Si = Si + {e}. 

3. If there is an element e £ Si such that T^Si \ {e}] > 

T[Si] and |0.5 - + a e \ > |0.5 - a s |, let Si = 

Si — {e}. Go to Step 2. 

4. Return maximum of T[Si] and T[X \ Si]. 


B. Solution to the UvPD Problem 

In this case our only constraint is that ai < a s where 
we assume h(a s ) = Ith,ot s < 0.5. Following the same 
pattern as that of Algorithm 1, we can write: 

Algorithm 2: Submodular Function Solution to the UvPD 
Problem 

1. Let Si = argmax eex= ^ lt M jT[Si = {e}] while 




U(S i = {e}) and W(Si = {e}) both satisfy the 
constraints. 

2. If there is an element e £ X \ Si such that 
T[Si + {e}] > T[Si] and U(Si + {e}) and W(Si + 
{e}) both satisfy the constraints, let Si = Si + {ei}. 

3. If there is an element e £ Si such that T[Si \ {e}] > 
T[Si] and U(Si — {e}) and W (Si — {e}) both satisfy 
the constraints, let Si = Si — {e}. Go to Step 2. 

4. Return maximum of T[Si] and T[X \ Si],. 


In both cases, we know that at the very last step T[Si] = 
T[X\5i]. Now we opt to calculate the complexities of this 
method. Steps 2 and 3 could repeat (M — 1) + (M — 2) + 
... + 1 = times each while every item could be 

removed and thus replaced a total of 2 M times. Thus the 
total complexity of steps 2 and 3 is equal to M 2 (M +1) = 
0(M 3 ). The complexity of step 1 is also equal to M. Thus 
the total complexity of the solution is equal to 0(M 3 ). 

These polynomial solutions simply make certain the 
maximal and minimal functions obtained are at least 0.432 
and at most 2.315 times the optimal objective functions 
respectively. This range of error occurs because in this 
method, we are removing and adding members from and 
to the set Si one by one. Thus, at each decision point 
we are making one locally optimal decision. However, 
it is widely known that a locally greedy method is not 
necessarily globally optimal [10]. 

Finally, we find it important to note that in both these 
algorithms we are considering the case when N = 2. This 
is because while the studies for general multi-agent multi- 
variable solutions are still ongoing, there has not yet been 
any closed form solution algorithm for a general value of 
N. This is due to combinatorial nature of such binning 
problems; when TV = 2, an item could either belong to 
one bin or the other and thus the idea of adding an item 
to a bin or removing it from a bin (as portrayed in either 
of the algorithms) is justifiable. However, when N > 2, 
there might be cases when no new item is added to a bin 
and instead any other two of the bins gain or lose items 
resulting in confusion while embedding an algorithm. 

V. Examples for the UvP and UvPD Problem 

In this section we aim to offer the reader an example 
where we aim to use the results gathered in previous 
theorems and Algorithms to first find a proper utility 
function given the specifics and then run and test the 
Algorithms’ results with that of an exhaustive search to 
compare the two methods in terms of complexity and exact¬ 
ness of the solution. As a general assumption, we assume 
M = 6, TV = 2, A = A 2 = 0.4, Ai = 0.6, I th = 0.47 
and that f(S) = /(IS]) where S and |S| represent any set 
and its cardinality respectively. We further assume the two 
following item probability distributions: 

7Ti i = 0.3, 7T2 ,i = 0.25,713,1 = 0.25, 

774.1 = 0 . 10 , 775,1 = 0 . 05 , 776,1 = 0 . 05 , 

771,2 = 0 . 2 , 772,2 = 0 . 2 , 773,2 = 0 . 2 , 

774.2 = 0.15, 775,2 = 0.15,776,2 = 0.10 (17) 

It is important to note why we have chosen a rather 
small amount of the value of M; we have done so to 


be able to carry out the exhaustive search for the optimal 
set selection by hand without much difficulty. Thus, the 
choice of a small value of M in no manner indicates a 
limitation of the overall results. One of the simplest utility 
functions which could satisfy the conditions presented 
in theorem 4.3.2 is a quadratic function in the form of 
/ P (|S|) = alS] 2 + b\S\ + C,p = 1,2. By calculating 
the I s * and 2 nd order derivatives we can see that they 
follow the form of .g p (|S'|) = 2a\S\ +b — a and g i ,(|S'|) = 
2 a respectively. We could then show that the function 
/ p (|S'|) = — |S'! 2 + 151 +44,p =1,2 satisfies all the sets 
of sufficient condition for imposing submodularity on both 
hypotheses. 

Note: In our current and following derivations of a de¬ 
sirable utility function we focus on developing a quadratic 
utility function. Such an assumption might appear to be 
extremely limiting to our class of functions at first sight. 
However, as is further explained in [11], such an assump¬ 
tion is quite understandable and rather desirable seeing as 
how in many economic models, functions are written as 
extensions of quadratic function and thus later calculations 
are simplified while still maintaining the essence of a utility 
function. 

A. Solution Comparison 

The results of running Algorithm 1 on each hypotheses 
have been gathered in Table I. Each cell represents the 
maximum overall utility achieved in either hypotheses 
by each method where it is obvious that the probability 
distribution plays a more vital role on the exactness of 
the solution compared to the utility function -which is the 
same in both cases. Also of interest is the loss of utility at a 
desirable complexity reduction from TVP to P which helps 
justify our persistence on utilizing submodular solutions. 


Table I 

Solution Exactness Comparison for UvP 


Hypotheses 

exhaustive search solution 

submodular solution 

1 

32.8124 

25.8124 

2 

25.8124 

25.8124 


Although we previously mentioned the possibility of a 
loss of accuracy, we still need to discuss the reason behind 
this disparity of results. While running the submodular 
algorithm for the first hypotheses, we need to both satisfy 
the leaked information constraint and maximize the utility 
function, this then results in the selection of the item with 
highest allowed probability ( 774,1 = 0.1). However, once 
this item is fixed into the solution set, we cannot add 
any more items, because the information leakage threshold 
is reached and we cannot remove any item because then 
the utility is lessened. However, if we were to run the 
exhaustive search method, we would see that choosing the 
two items with the lowest probabilities ( 775,1 = 776 ,1 = 
0.05) would have offered a better overall utility. Thus, the 
biggest limitation in submodular set solutions appears in 
the process of allocating the first item to the solution set. 

In the second hypotheses, such a problem does not occur 
since the first set solution choice already represents the best 
first choice. 



VI. Examples for the UvPD Problem 


References 


Using the same item probability distributions as those 
in Eq.(17), we can see that the major difference is in how 
this time the average leaked information needs to be less 
than the desired threshold. As a result, we can showcase 
two cases of the scenario with two random hypotheses 
where the submodular solution varies in accuracy; if we 
assume P(G\) = 0.99, E’(G , 2 ) = 0.01, we are basically 
offering a larger weight to the problematic scenario which 
results in a compromise in accuracy. Similarly, if we 
assume P(G* 2 ) = 0.99, P{G\) = 0.01, we are allowing 
more weight to the hypotheses with higher accuracy thus 
resulting in an optimal solution. We could thus deduce that 
accuracy in this class of problems has to do with both 
the hypotheses probabilities and each hypotheses’ item 
probability distributions. We follow through with two such 
hypotheses which result in one uniform utility function 
/ p (|Sj) = — |S'! 2 — 458|Sj + 2796. The results of these 
two hypotheses probabilities are gathered in the following 
table: 

A. Utility function for Table II 


Table II 

Solution Exactness Comparison for UvPD 


Hypotheses Selection 

exhaustive search solution 

submodular solution 

P(GA = 0.99 

1042.1 

757.3537 

p(g[) = 0.01 

757.3537 

757.3537 


It is important to note that other than each hypotheses’ 
probability another factor which could play a vital solution 
in the accuracy of the results is the value Ai which 
intensifies the distinguishability measure. In our examples, 
we specifically chose a rather small value for Ai so as to 
be able to witness the role of each hypotheses. However, 
if Ai was a non-negligible value, such a straightforward 
could not be deduced. 

VII. Conclusion and Future Works 

In this paper, we introduced and formulated two prob¬ 
lems widely regarded in online browses. While the first 
problem was briefly introduced in our previous work 
[8], we used this paper to expand upon the derivation 
of previous results. Furthermore, we expanded upon our 
previous problem by introducing the concept of hypotheses 
distinguishability thus realizing a fully novel problem. 
To carry out such goals, we formulated two separate 
utility function based upon the user’s utilization of the 
network. We showcased how each problem formulation 
results in a multi-agent multi-variate problem which is 
A'/'-complicated. We then introduced the concept of sub¬ 
modularity and multi-submodularity which help reduce 
the complexity of such problems to that of a polynomial 
at the cost of some accuracy. We derived a series of 
sufficient conditions which would guarantee the existence 
of a solution. Once the existence of such solutions was 
guaranteed, we introduced algorithms that could help us 
reduce the complexities while refraining an acceptable 
solution. Finally, we used examples to illustrate trade-off 
between accuracy and complexity of the solutions. 
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VIII. Appendices 

A. Theorem 111.1 

Proof For an easier understanding, our proof of the theo¬ 
rem is broken into two sections: 

Sufficient Conditions for Fi(Sj): 

Starting for F\(Sj), by rewriting Eq. (10) we will have: 

a B A[~g(B + {x}, B) + A log [——^-]] 

1 + a j 4 +a BA 

+a x [-g{B + {x}, A + {x}) + A log ^ \ BA ]] 

OLA+a x 

+a A [-q{B + {x},B,A + {x},A) + \log > 0 

' aj+o, 

(18) 

where we have used the definition of functions / and g. 

Since the above inequality needs to hold true for all 
possible sets of A C B, x B, we aim to determine the 
maximal amount enforced by the above set of inequali¬ 
ties. The first factorization results in two inequalities: (1) 
g{C,D) < 0 and (2) \g(C,D)\ > Amax(log (1 + |^)) 
for all sets D C C. To find the maximum of such a limit, 
we need to impose the one item with highest probability 
to { x } and assume the one lowest probability item to set 

B. Then the above inequality is maximized. 

The second factorization results in two inequal¬ 
ities: (1) g(C,D) < 0 and (2) \g(C,D)\ > 

Amax(log (1 + )) for all sets D C C. To find the 

maximum of such a limit, we need to impose the item with 



lowest probability to {x} and that ola = 0 and then have 

cxba = 1 — oc x = as- 

The third factorization could be simplified. The log¬ 
arithm argument consists of a nominator greater than 
denominator thus resulting in the overall logarithm argu¬ 
ment to be positive. We thus only need to impose that 
q{C, Ci, D, D\) < 0 for all sets C\ C C. D\ C D and 
DCC. 

It then follows that if the probability of each item 
is sorted in a decreasing manner as 7 Ti,...,7Tm> we 
can write the set of 3 sufficient conditions for sub¬ 
modularity of set function F- t = 1 as (1) 

g(C,D) < 0 (2) q(C,Ci,D,Di) < 0 (3) \g(C,D)\ > 
A log [max( 1 + ^,1 + ^“-)] = Alog(^) for all sets 
Ci C C, D x C D and DCC. 

Sufficient Conditions for F 2 (Sj): We now follow the 
same method for F‘ 2 (Sj),j = 1 by rewriting Eq. 

( 10 ): 


ceBA[-g{S — B,S — B — {x}) 

oa — Cuba ~ o x ^ 
+a x [-g{S - A - {x}, S - B - {x}) 
CUBA 


—A log (1 + -- 

1 - a a - 


-AIog(l + 


-)] 


1 — a A — o x — olba ' 
+a A [q(S — A,S — A — {x}, S — B,S — B — {x}) 

1 — OLA 1 — aA — OLBA - Ol x 


T A log (- 


-)] 


' 1 — ola ~ ol x 1 — ola — olba 
+1 [~q(S ~A,S~A~ {x}, S — B, S — B — {x}) 
-1 - OLA - Ol x 1 OLA olba 


+Alog(- 


1 — OLA 1 — — &BA — 


)]>0 (19) 


where we have used the definition of functions / and g. 

Since the above inequality needs to hold true for all 
possible sets of A C B, x ^ B, we aim to determine the 
maximal amount enforced by the above set of inequali¬ 
ties. The first factorization results in two inequalities: (1) 
9 (C,D) < 0 and (2) \g(C,D)\ > A log (1 J 
for all sets DCC. To find the maximum of such a limit, 
we need to impose the one item with highest probability 
to {x} and assume that is maximal while still less than 
1 — a x . This limit is imposed so that the denominator is 
not equal to 0. Then the lower bound for g(C,D) will be 
equal to Alog(l + £ ( ). 

The second factorization results in two inequal¬ 
ities: (1) g(C,D) < 0 and (2) \g(C,D)\ > 

A?nax(log (1 + )) f° r sets ^ C C. To find the 

maximum of such a limit, we once again need to impose 
the one item with highest probability to {x} and assume 
that a a = 0 and as is maximal while still less than \ —a x . 
This limit is once again imposed so that the denominator 
is not equal to 0. The secondary lower bound for g(C. D) 
will be equal to A log (1 + )• 

The third and forth factorization could be simplified. We 


could write them as 


[-1 + aAllglS 1 — A,S — A — {x}, S — B,S — B ~ {x}) 
1 - a a - ol x 1 — ola — aBA 

1 — OLA 1 — OLA — OLBA — Ol x 

+a A Alog(l)>0 (20) 


+Alog 


-}+ 


Furthermore, the logarithmic argument is always positive 
seeing as how the nominator of the fraction inside it is 
greater than the denominator. It once more thus follows 
that we merely need to impose q(C, Ci, D, D{) < 0 for 
all sets Ci C C,Di C D and DCC to help above 
inequality hold true. 

Thus 3 sufficient conditions for submodularity of 
F‘ 2 (Sj ), j = 1 ,...,N are developed: (1) g(C,D) < 0 (2) 
q(C,Ci,D,Di) < 0 (3) \g(C,D)\ > Alog(^) for all 
sets C 1 CC,D 1 C D and DCC. 

Finally, the intersection of the two sets of sufficient 
conditions for submodularity of Fi(Sj) and F 2 (Sj) gives 
us the sufficient conditions for submodularity of F(Sj) 
which turns out to be the same as either of theirs. 

□ 


B. theorem III.2 

Proof. For the specific case N = 2: 

F{A) = a A f(A ) + (1 - a A )f(S - A) + a A log {a a) 
+(1 - aA)log(l - a A ) (21) 

In such a case, diminishing return property requires that 


a B A{~g{B + {x}, B) - g(S - B,S - B - {x}) 

. ,, , ola + olba 1 ola olba ol x ., 

TA log (-------)}+ 

OLA + OLBA + OL x 1 — OLA — OLBA 

OL x {-g{B + {x}, A + {x}) - g(S -A- {x}, S — B — {x 

ola + ol x 1 — a a — ol x — as a , 

' OLA + OL x + OLBA 1 — OLA — Ol x 

+a A {~q(B + {x}, B,A + {x}, A) 

+q(S — A, S — A — {x}, S — B, S — B — {x}) + A 


+Alog (- 


l -)} 


log( 


OLA + OL x 1 — OLA ~ OL x — as A OLA + OLBA 

ola + ol x + olba 1 — ola ~ ol x ola 

OLA X j 
1 — OLA — OLBA 

+{-q(S — A, S — A — {x}, S — B, S — B — {x}) 

TA log (-- - -)} 

1 — OLA — Ol x — Ob A 1 — OLA 

> o (22) 


where we have used the definition of functions / and g. 

We aim to find sufficient conditions so that the above 
inequality can hold true. The first factorization is the 
coefficients of oba > 0. We thus need to ascertain that 
the coefficient is also positive. To do so, we impose that 


-2 g(C, D) + Alog( 


a a + oba 
a a + olba + o x 


1 ~ oa - Qba — Ox 
1 — a a — oba 


> 0 


(23) 




for all possible sets D C C. We thus need to impose two 
sufficient conditions: 

(1) g(C,D) < 0,VC 
(2) 2\g(C,D)\> 

max(| A log (-------)}|) 

a a + Cuba +«i 1 — a a ~ olba 

= A log K x (24) 


As for AYs value, it could be seen that 


r) (30) 


The second factorization is the coefficients of a x > 0. We 
once again need to ascertain that the coefficient is also 
positive. To do so, we impose that 

(1) g(C,D) < 0,VC 
(2) 2\g(C,D)\> 


max( | A log (- 


1 — a a — ol x — as a , 


1 — otA — at x 


K 2 = max log (1 + - - aA - t) (30) 

[a A + a x )(l - a B - a x ) 

over all possible combinations of cts and a x and a a where 
otB + o: x < 1 and a a < alphas■ We can use a variable 
a z = a a + a x and using the definition olba = ctB — o a, 
we can rewrite the definition of K> as: 

I< 2 = max log (1 H-————- -) (31) 

ct 2 (l a z (Aba) 

over all possible combinations of a z and as a where «s + 
a x < 1. We have thus managed to transform the definition 
of A '2 into a definition that is very similar to that of AYs. 
We then use the same reasoning to acknowledge that 

K 2 = log (1 + —— 5 —) (32) 


= A log K 2 


Finally, we will have: 


for all sets D C C. We will later determine what 
max{ AY Af 2 } entails. 

The 3 rd and 4 th factorizations could be merged together 
in the following manner: 

[-1 + oa]{-«(5 -A,S-A- {a:}, S-B,S-B- {x}) 

1 — a a — a x l — a.A — oiba^ 

+ log( 1 - z -)} 

+a A {-q(S - A, S - A - {x}, S — B, S — B — {x}) 
—q(B + { 2 }, B, A + { 2 }, A) 

+ log(----—)} (26) 

(A A OLA + Olba + OL x 

We can show that both A coefficients are always posi¬ 
tive. Thus, all we need to impose to guarantee the 3 rd 
and 4 th factorizations do not cause any ambiguities, is 
q(C, Ci, D, Di) < 0 for all sets C\ C C, Di C D and 
DCC. 

Finally, we need to calculate both values of A'i and K 2 
and determine the choice of maximum between them. It 
could be seen that 

K\ = max log (H-———--) (27) 

OLr\ 1 — OtB — OL x ) 

over all possible combinations of ae and a x where as + 
a x < 1. Thus, in order to find a global maximum over 
the surface entailed by aB and a x , we first need to check 
if the Hessian of the function is always negative (whether 
the function is concave). However, as it turns out, this is 
not necessarily true for all values of a/> and a x . We thus 
suffice to finding an upper bound for K\ which is easier 
to calculate. This action is justifiable since we are looking 
for sufficient conditions and thus they do not need to be 
tailor made tight. 

a x a x 

1 +- 7 ^--, < 1 +-— = L, (28) 

as(i — aB — at x ) kmOlb 

It could then be proven that the new function L\ is always 
concave and thus a global maximum could be calculated. 
This maximum occurs when a x = 7ti and ag = 7 tm and 
thus we would have: 

FY=log(l+^-) (29) 


max{Ki, K 2 } = log (1 


1 — ttm , 


Finally, the new set of sufficient conditions could be 
summed up as (1) g(C,D) < 0 (2) q{C 7 C±, D, Di) < 0 
(3) 2\g(C, D)\ > Alog(l + i^f JS£ ) for all sets C 1 C 
C, Di C D and D C C. 

C. theorem III.3 

By first imposing the diminishing returns property to 
F(Sj) as formulated in Eq.(14) we see it necessary to have: 

a Xl {-P(G 1 )g(S < ^ ) + {x}W,sW + {xjW) 
-Ai(log(aB 2 +a X2 ) - log («a 2 -hOi^)) 

— {P(Gi )\ 2 + Ai)(log (qbj + a Xl ) — log (a Al + an))} 

+OL X2 {—P(G 2 )g(S^ + + {2} (2) ) 

-At(log (a Bl +a Xl )~ log (a Al + a Xl )) 

- (P(G 2 ) A 2 + Ai)(log (qlb 2 +a X2 ) - log (ola 2 + a X2 ))} 

+a B A 1 {-P(Gi)g(S^ ) + {x}^\S^) 

- Ai (log (a fi2 + a X2 ) - log as 2 ) 

— (P(Gi)A 2 + Ai)(log (a Bl +a Xl ) - loga Bl )} 
+a B A 2 {-P(G 2 )g(S^ + {x}^,S^) 

-Ai(log (asi +a Xl )- log a Bl ) 

~(P(G 2 )X 2 + Ai)(log(«B 2 +a X2 ) - log q:b 2 )} 
+a Al {-P(Gi)q(S { B ) + {*}W,5^ 1) ,4 1) + W (1) ,^) 

, , , ola 2 OLb 2 + 0:3:2 , 


-Ai log ( 


QLA 2 + OC X2 OLB 2 
, , / OLA 1 Q:j3i + OL Xl 


— {P{G\)\ 2 + Ar) log (- f -^^l )} 

OLA 1 + Q!xi OL B 1 

+a A2 {-P(G 2 )q(S™ + {x}^,S^,S^ + {2}^Yi 2) ) 

, , , OL A 1 OL Bl + OL x 1 


-Ai log ( 


~(P(G 2 )\ 2 + Ar) log ( 


aAl + rfci a B 1 

OlA 2 olb 2 + OL X2 


OLA 2 + OL x 


)}>0 


(29) 



Now we attempt to find the sufficient conditions for 
Inequality (34) to hold true. 

If we assume q(Sj, S w , S r , S t ) < 0 ,S W C Sj,S t C 
S r , S r C Sj we will satisfy the positivity of cm, , cm 2 
coefficients. 

It could further be seen that if we define 


and 


wi = max{ 
u >2 = rnax{ 

Xi = max( 
X 2 = max( 


OL Bl 

+ 

&Xl 

CM, 

+ 

&Xi 

ole 2 

+ 

O/.X 2 

CXA 2 

+ 

&X2 

as 1 

+ 

(%Xl 

a B] 

L 

a B2 

+ 

&X 2 


a B2 


then as long as g(Sj,S w ) < 0, S w C S}- and 


max{ X -2 log wi 
A 2 log w 2 + 
A 2 logXi + 
A 2 log X 2 


l.9l> 

Ai 


P(Gi 
Ai 


log (W 1 W 2 ), 


^(G 2 ) 

Ai 

m) 

Ai 

P(G 2 ) 


log (UJ 2 U 1 ), 
log (X 1 X 2 ), 
log (X 2 X 1 )} 


U) 1 = 1 


OtBcti 
CM, + 


Wi = 


w 2 = 


7TM, 

1 

TlA In 


Similarly, it could be shown that 

OL x 


Xi = 1 


TTll 


Wl = 


W 2 = 


Furthermore, we could see that as long as M > 2, we will 
have oji > Xi and 0 J 2 > X 2 - We can thus rewrite the final 
set of sufficient conditions in the following manner: 

(1) q{Sj,S w ,S r ,S t ) < 0 

( 2 ) g(Sj,S w )< 0 

(3) 

Ai 


max{ X 2 logwi 


(35) 


P(Gi) 

Ai 


log (wiw 2 ), 


(36) 


(37) 


we will have satisfied the diminishing returns property in¬ 
equality. In the next step we attempt to calculate the values 
of wi,W 2 ,Xi an d X 2 - We further note that u>i,i £ {1,2} 
and Xjtj G { 1 , 2 } represent the same set functions under 
different hypotheses, we could thus find the maximum 
values of either lu, and Xj and then simply change the 
hypotheses index to derive the other. In the rest of this 
proof, we aim to find the maximal values of wi and Xi- 
It could be shown that 


(38) 


Thus, the maximum value of cur occurs when o /> 4 , and 
cm, + ol Xi are maximum and minimum respectively. This 
could occur when a Xl = 7 tm,, cm, = 0 and oba, = 
1 — tcmi- Then we would have: 

1 


(39) 


(40) 


-^-<1 +- 

a Bl ol Bi 

Thus, the maximum value of xi occurs when a Bl is 
minimum. This could occur when a Bl = 7Tm,- Then we 
would have: 

Tfl, + 7 TMi 
7rMi 

7Ti 2 + 7 Tm 2 


A 2 log UJ 2 + , , log (w 2 wi)} 

P\G 2 ) 

1 1 

,wi = -, w 2 = - 

7t m 2 

Now we aim to carry out the same process for F(Sj) as 
formulated in Eq.(15). By imposing the diminishing returns 
property, we see it necessary to have: 

ot Xl {—P(Gi)g(S - S i 1} - {*}«, S - 4 1} - M W ) 
-Ai(log (1 - cm 2 - a X2 ) - log (1 - a B2 - a X2 )) 
-(P(G 1 )A 2 + A 1 ) 

(log (1 - «Ai - a Xl ) - log (1 - a Bl - a*,))} 
+a X2 {-P(G 2 )g(S - - {x}^,S-S { ^ - {z} (2) ) 

-Ai(log(l - a Al - a Xl ) - log (1 - a Bl - a Xl )) 
-(P(G 2 ) Aa + Ar) 

(log (1 - «a 2 - a X2 ) - log (1 - «b 2 - a X2 ))} 

a BAl {~P(Gi)g(S - S%\S - 4 1} - {.x} (1) ) 

—Ai(log (1 - a B2 ) - log (1 - a B2 - a X2 )) 
-(P(Gr) A 2 + Ar) 

(log (1 - a Bl ) - log (1 - a Bl - «xi))} 
+a BA2 {-P(G 2 )g(S - S%\S- S™ - {*}W) 
-Ai(log(l -a Bl )~ log(l - a Bl - a Xl )) 

— (P(G 2 )A 2 + Ar) 

(log (1 - a B2 ) - log (1 - a B2 - a X2 ))} 

+(1 - a Al ){—P(Gi)q(S - S%\S - S « - {4 (1) , 
S-S^,S-S^-{x}^) 

—Ar log ( l-a B2 -a X2 ) 

1 — cm 2 — a J2 1 — a B2 

-(P(G,)A 2 + A,) log ( 1 -»«■-»- )) 

1 — cm, — ot Xl 1 — a Bl 

+(1 - a A2 ){-P{G 2 )q{S - S%\ S - Sf - W (2) , 

S-S^,S-S^ -{x}^) 

. , , 1 — ola 1 1 — ol Bi - a Xl 

1 — a Al — a Xl 1 — a Bl 

l 1D(ri A A 1 A A 1 ( ^ 01 A 2 1 CtB 2 OL x 2 , , 

-(P(G 2 )A 2 + Ai)log(- --- )} 

I — cm 2 — ol X2 1 — a B2 

> 0 (42) 

Once more, it could further be seen that if we define 

1 - ol Ai - a Xl 


oj 1 = max 


n M 2 


(41) 


w, = max 


1 — a Bl — ftj, 
1 OL A 2 OL x 2 

1 — ct B — a x 


(43) 



and 


set of sufficient conditions in the following manner: 


1 — a Bl 


Xi = maxy- -) 

1 — OfR, — a x , 


X 2 = max- 


otBi — a Xl 
1 - a Ba 


1 — «b 2 — a X2 
then as long as g(Sj,S w ) < 0 ,S W C Sj and 


(44) 


l.9l> 

max{\ 2 logwi + —A log (wiw 2 ), 

(Gi) 

A 2 logW2 + * log (utawi). 

P(G 2 ) 

A 2 logxi + l°g (xixi), 

A 2 log xi + log (xixi)} (45) 

we will have satisfied the diminishing returns property in¬ 
equality. In the next step we attempt to calculate the values 
of w}) w i>xi an d xi- We further note that cj', i £ {1,2} 
and Xj>3 ^ {1)2} represent the same set functions under 
different hypotheses, we could thus find the maximum 
values of either hj\ and x'j and then simply change the 
hypotheses index to derive the other. In the rest of this 
proof, we aim to find the maximal values of and xi ■ 

It could be shown that 


(1) q(Sj,S w ,S r ,S t ) < 0 

(2) g(S j} S w )< 0 

(3) |<7(S,-,^)|> 


maa;{A 2 logo;} + ^ log (u/^i), 


A 2 log ui 2 + 


P(Gi) 

Ai 


P(G 2 ) 

1 


log (wX)} 

1 


,UJ 1 = 


UJr, = 






KM 2 


Now that we have found the set of sufficient conditions 
for F(Sj) in both cases of Eq.(14) and Eq.(15), we can 
find the overall set of F(Sj) in all cases as: 


(1) q(Sj,S w ,S r , S t ) < 0 

( 2 ) giS^S^KO 

(3) |«?(5 J -,S 10 )|> 

maa;{A 2 logo;} + * log (u>[uj 2 ), 


A 2 log lo 2 + 


P(G 1 ) 

Ai 


P(G 2 ) 

, 1 , 

,uj 1 = -,w 2 = - 

TTMi ttm. 


log (wX)} 

1 


□ 


Xi = 1 + 1 --- (46) 

1 — (tflj — a Xl 

Thus, the maximum value of u>i occurs when a Xl and 
a Bl + a Xl are both maximum. This could occur when 
a x 1 = 7T1J, olb 1 + a. Xl = 1 — 7TMi- Then we would have: 


xi = 
xi = 


TTli + TTMi 
TtMi 

7Ti 2 + 7 Tm 2 

7TM 2 


(47) 


Similarly, it could be shown that 


/ 1 rt J 4 1 oc Xl 1 

w i = i 

I — a Bl - a Xl 1 — a Bl - a Xl 
< 1 — Tf 

1 — a Bl — ct a , 1 


(48) 


Thus, the maximum value of w} occurs when a Bl +a Bl = 
1 — 7 TMi- Then we would have: 


w 


/ 

1 


ijj 


/ 

2 


1 

TTMi 

1 

TM 2 


(49) 


Once again, we could see that as long as M > 2, we will 
have w} > xi and u> 2 > xi- We can thus rewrite the final 



